Multiple-copy state discrimination: Thinking globally, acting locally 
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We theoretically investigate schemes to discriminate between two nonorthogonal quantum states 
given multiple copies. We consider a number of state discrimination schemes as applied to nonorthog- 
onal, mixed states of a qubit. In particular, we examine the difference that local and global opti- 
mization of local measurements makes to the probability of obtaining an erroneous result, in the 
regime of finite numbers of copies A*', and in the asymptotic limit as N ^ oo. Five schemes are 
considered: optimal collective measurements over all copies, locally optimal local measurements in 
a fixed single-qubit measurement basis, globally optimal fixed local measurements, locally optimal 
adaptive local measurements, and globally optimal adaptive local measurements. Here an adaptive 
measurement is one in which the measurement basis can depend on prior measurement results. For 
each of these measurement schemes we determine the probability of error (for finite A'^) and scaling 
of this error in the asymptotic limit. In the asymptotic limit, it is known analytically (and we verify 
numerically) that adaptive schemes have no advantage over the optimal fixed local scheme. Here we 
show moreover that, in this limit, the most naive scheme (locally optimal xed local measurements) 
is as good as any noncollective scheme except for states with less than 2% mixture. For finite A*', 
however, the most sophisticated local scheme (globally optimal adaptive local measurements) is 
better than any other noncollective scheme, for any degree of mixture. 

PACS numbers: 03.67.Hk, 03.65.Ta 



I. INTRODUCTION 

Understanding the principles governing the process of 
measurement has the potential to illuminate fundamental 
questions and practical notions of the behaviour of the 
physical world. Yet despite the exceptional experimen- 
tal success of quantum theory, many questions regarding 
the fundamental principles of quantum measurement re- 
main. A key aspect of this is the restriction imposed 
by quantum measurement that a measurement of an un- 
known quantum system cannot usually reveal complete 
information about that system. The consequences of this 
become evident in the problem of discriminating between 
two nonorthogonal quantum states [T . Consider a quan- 
tum system that was prepared in one of two known states, 
but we do not know which one. The task is to determine 
which of the two preparations took place. If we consider 
only pure states, we may represent the two possibilities 
as and \ip-). Except for orthogonal states (where 
KV'-t-IV'-)! = 0) there is no measurement that could be 
applied that will deterministically find the state of the 
system without error 

Despite this, it is possible to construct a measurement 
that always determines the correct state, but doing so 
produces a nonzero probability of obtaining an inconclu- 
sive result [P-T. Alternatively, one can construct a mea- 
surement in which all results are conclusive, but neces- 
sarily possessing some probability of error, which is to be 
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minimized [TJ [5] . This minimum-error- probability mea- 
surement is known as the Helstrom measurement, pos- 
sessing an error probability C dependent upon the over- 
lap of the states, K'i/'-t-lV'-)!- 

Should additional copies of the unknown state be avail- 
able [5], one can devise measurement schemes that ex- 
ploit these additional copies to achieve a lowered error 
probability. An example is the simple "majority vote" 
scheme, where the Helstrom measurement is applied to 
each of N copies individually, and the overall result cho- 
sen according to which state corresponds to the majority 
of individual results. The probability of error in such a 
scheme scales as C™'^^ oc 

In comparison, the collective Helstrom measurement, 
performed at once on the collective state of all copies, 
|V'±)'*^! achieves a probability of error C"^°' = (1 — 
v/1 - |(V'+|V'-)P^)/2. While this strate gy is strictly op- 
timal, with C™' scaling as \{ip+\4'-)\'^^ for iV — >• cx), it 
is not in general obvious whether the Helstrom measure- 
ment on iV > 1 copies can be performed using only local 
measurements, making it difficult to practically realize 
using current experimental techniques. 

For this special case of discriminating pure nonorthog- 
onal quantum states, |?/'+) and it is possible to 
achieve the same scaling of error probability as the col- 
lective Helstrom measurement by using a fixed local mea- 
surement (as in the majority vote scheme but using a 
different fixed measurement) applied to all N copies [7]. 
Furthermore, by performing local measurements on each 
copy adaptively, choosing the measurement bases to min- 
imize the error probability at each stage (thus being "lo- 
cally optimal" adaptive measurements), it is possible to 
precisely achieve the optimal performance as defined by 
the collective Helstrom measurement for any A*" [71 [5] ■ 
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The above picture becomes much more comphcated 
for mixed states. In this general case, the locally op- 
timal adaptive local measurement scheme can exhibit 
worse performance than the simple majority vote scheme 
when the number of copies exceeds some threshold [5]. 
Clearly, neither scheme is the optimal local measurement 
scheme in the general case. To derive a local measure- 
ment scheme that is globally optimal, with the least prob- 
ability of error for all N, requires an analysis based on 
dynamic programming llOj. This was done in Ref. [9], 
and tested experimentally, for N < 10. 

Here we analyze the performance of a number 
of multiple-copy two-state discrimination schemes, fo- 
cussing on the distinction between local optimization and 
global optimization as it applies to the resulting proba- 
bility of error of these schemes. We consider two classes 
of local measurement schemes: those using identical mea- 
surement bases for each copy (fixed measurements) and 
those in which the measurement bases are changed as 
each measurement result adds new knowledge (adaptive 
measurements). This gives us four schemes for consider- 
ation: locally optimal fixed (LOF) local measurements, 
globally optimal fixed (GOF) local measurements, locally 
optimal adaptive (LOA) local measurements, and glob- 
ally optimal adaptive (GOA) local measurements. For 
comparison, we also consider the optimal collective mea- 
surement (OCM). For each of these schemes we calculate 
the error probability for states with various levels of de- 
polarizing mixture, for finite N. We then determine the 
large N scaling of the schemes by directly calculating the 
corresponding Chernoff bound in each case. Doing so, 
we verify recent theoretical bounds [TTJ [12] placed on the 
asymptotic scaling of local measurement schemes. In ad- 
dition, we analyze the relationship between the LOF and 
GOF local measurement schemes as mixture is varied, 
and find that the latter has no advantage in the asymp- 
totic limit for all qubit states with mixture greater than 
2%. 
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FIG. 1: Probability of error of optimal collective measure- 
ment of up to 10 copies with a = n/6 for various amounts of 
depolarizing mixture i/, and for equally likely states {q = 1/2). 

to each of the N copies of the unknown state. A 
local projective measurement performed in the basis 
{|0)(0|,|0^)(0^|}, where 

10) EE COS0IO) -f sin(/i|l) (2) 

is parametrized by the angle (p G [0, 7r/2). (Due to our 
choice of Hilbert space basis in defining our states p± , we 
can restrict measurements to those with real coeHicients 
in this basis.) Each of the local measurement schemes 
then constitutes a sequence of measurement angles {(f>n}, 
where 1 < n < N . Given the state parameters v and 
c = cos a, Bayesian inference can be used to optimally 
analyze the measurement results and make a final deter- 
mination of that state, with some probability of error, 
C = C^(i.,c). 

A. Optimal Single-Copy and Collective 
Measurement 



II. DISCRIMINATION SCHEMES 

Consider a pair of pure qubit states passing through 
a depolarizing channel p^. Let a < it/ 2 he the angle 
separating the two states in Hilbert space {a = tt/2 thus 
corresponds to orthogonal states). Without loss of gen- 
erality we may represent these states as 

/5± = (1 - J^)(i'cosa± Asina)] (1) 

where X and Z are the Pauli operators, and ly e [0, 1] 
quantifies the level of mixture applied by the channel. 
Given a collection of N identical copies of either the /)+ or 
the p- state (with equal probability) , our task is to then 
determine, conclusively and with minimal probability of 
error, which state was given. 

The four local discrimination schemes are defined by 
a sequence of local projective measurements applied 



For any number of copies N > 1^ the minimum pos- 
sible error probability C can be obtained in principle 
by measuring all N copies of the state using a collective 
measurement. A collective measurement is one that, in 
general, cannot be achieved by local measurements, even 
allowing for the measurement basis to be chosen adap- 
tively (i.e. based on prior results). The optimal collective 
measurement for any N can be performed by measuring 
the observable f = qpf^ - (1 - g)/5?^ [B H] and guess- 
ing the state as p+ or /5_ corresponding to the sign of the 
result obtained. Here q is the prior probability of the sys- 
tems being in the p^ state. The resulting probability of 
error, known as the Helstrom lower bound, is found to be 
<o 7j J where "fj are the eigenvalues 

off. 

Given the conditions a and we can compute the 
eigenvalues of the operator F and determine the prob- 
ability of error. Some examples for a = 7r/6 and var- 
ious ly are shown in Fig. fl] When 1^ = (that is, for 
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FIG. 2: Single-copy state and measurement on a Bloch sphere 
cross-section. The states p+ and /5_, separated by a Bloch- 
space angle of 2a (corresponding to an angular separation of 
a in Hilbert space, as used in the text; here a = 7r/6), are de- 
picted with = 0.1 depolarizing mixture. The measurement 
basis shown, here with (/> = 7r/4, is referred to as an 'unbiased' 
measurement. In the case of equal prior probability (g = 1/2), 
this is the optimal single-copy measurement. 
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FIG. 3: Single-copy state and measurement on a Bloch sphere 
cross-section, with states p+ and p_ with = 0.1 depolarizing 
mixture. The measurement basis shown is with </> = a/2, 
optimal in the limit q — > 1. This and the opposite case where 
= 7r/2 — a/2 (optimal for g — > 0) are referred to as 'fully 
biased' measurements. 



pure states) the eigenvalues of f can be determined an- 
alytically, and the optimal collective measurement error 
probability C'^^^ is given by 



/-<OCM| 



o = Mi-Vi-49(i-g)e 



,2^ 



(3) 



where c (that is, cos a) in this pure state case can also 
be written c — KtA+I"!/)-)!. 

The optimal single-copy measurement (OSM) is the 
special case iV = 1. This is a projective local measure- 
ment, independent of v due to the invariance of the di- 
rection of eigenvectors under depolarization. The mea- 
surement angle is given by 



^oSM(-^^ ^ 1 arccot[(29 - 1) cot a\. 



(4) 



For the case of equal prior probabilities (g — 1/2) 
the measurement angle ^^SM jg .^y^^ with I'/') (01 and 
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FIG. 4: Probability of error of locally optimal fixed local mea- 
surements of up to 10 copies with a = 7r/6 for various amounts 
of depolarizing mixture v. 



\ symmetric about the state vectors p± (see, e.g., 
Fig.[2|. The probability of error is 



C 



OSM 



I - (1 - v)\J\ - c2 



(5) 



for all V e [0,1]. 

We note that in the case of unequal prior probabili- 
ties, q ^ 1/2, the optimal single-copy measurement an- 
gle '}-,iaggg' towards the more likely state. As q in- 
creases from 1/2, the measurement angle decreases such 
that, in the limit of complete prior determination (7 = 1, 
^OSM _ (yj2^ with 10) (01 lying in the direction of pj^ (see 
Fig. [s]). Similarly, for g = 0, 0°^" = 7r/2 - a/2, placing 
10^) (0^1 in the direction of /5_. We refer to the measure- 
ments in these limits as being 'fully biased', and the case 
where = 7r/4 as being 'unbiased'. 



B. Locally Optimal Fixed Local Measurements 

For any locally optimal scheme, the optimization is 
done by minimizing the error probability of the measure- 
ment of only a single copy. The measurement angles are 
therefore defined by (^osm (dependent on q). Here we 
consider the case where each measurement is also per- 
formed independently, i.e. without any information ob- 
tained from the measurement of other copies. Given 
a, and q — 1/2, all constants, it follows that the mea- 
surement bases of such a scheme are fixed for all copies: 

0LOF ^ ^/4^ 

The LOF scheme reduces to a simple binomial decision 
problem, with probability of error 
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N 



LW/2J 
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l-C 



OSM\ 



OSM 



N- 



(6) 



for N odd. Because the measurements are fixed and sym- 
metric about the states p± , for N even there is the addi- 
tional possibility of obtaining measurement results such 
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that Bayesian analysis equally favors both states. In this 
case, we may arbitrarily choose the overall discrimination 
result, because the equal a priori likelihood of the states 
ensures that any choice will lead to the same probability 
of error. For example, we can randomly choose the out- 
come. Equivalently, we may choose the state that would 
have been chosen if the final measurement had not been 
made, i.e. by considering only the first — 1 measure- 
ment results. In doing so, it is evident that the prob- 
ability of error for the LOF scheme will not differ from 
any odd — 1 copies to an even TV copies, regardless 
of our method for choosing the overall outcome in this 
situation. This results in a stepwise decrease of the error 
probability as the number of copies increases, examples 
of which can be seen in Fig. [4j where various levels of 
depolarizing mixture v are applied to the state. 
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FIG. 5: Angles of globally optimal fixed local measurements 
^GOF "i^g gQpjgg ^jth a = 7r/6 for various amounts 

of depolarizing mixture v. 



C. Globally Optimal Fixed Local Measurements 

To achieve global optimization it is necessary to 
consider the error probability that arises after all N 
copies have been measured. One might consider a di- 
rect approach, calculating the total error probability by 
summing over the set of possible outcomes where the 
Bayesian posterior probability is satisfied (i.e. greater 
than one half, for each outcome), similar to Eq. ([6]), and 
optimizing over the measurement angle. 

Instead of this approach, however, we present an al- 
ternative approach based on dynamic programming |10j . 
While more complicated than a direct summation, this 
approach is considerably more powerful, and will be nec- 
essary in following sections. For pedagogical reasons, we 
introduce it here for the easiest to understand case: a 
fixed scheme. 

The dynamic programming approach allows us to con- 
struct a recursive analytic expression for the resulting 
probability of error. Suppose all TV copies are measured 
in the basis defined by the angle 0, and Bayesian analysis 
is applied to the first n < N measurement results. We 
obtain a value Pn+i that quantifies the credulity that the 
prepared state is p+ at this stage. 

Let us define i?„ as the probability of error we expect to 
obtain after measuring the remaining N — n copies. For 
n = N, this is a simple function of the final credulity, 
Rn{Pn+i) = niin(PAr+i,l — P^r+i). For general n, the 
expected error probability at the (n — l)th measurement 
can be determined from the expected error probability at 
the nth measurement, as 



Rn-l{Pn) = ^Pr[7^„|P„,.^]i?„(P„+i), 



(7) 



where Dn represents the possible outcomes of the 
measurement of the nth copy and Pr[Z?„|P„, — 
Pr [Pi„ I p+ , 0] P„ Pr [P>„ I /3_ , (A] ( 1 - P„ ) . In order to eval- 
uate P„, the credulity P„+i can be calculated from the 
credulity P„ and measurement outcome Z3„ by applying 
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FIG. 6: Probability of error of globally optimal fixed local 
measurements of up to 10 copies with a — n/6 for various 
amounts of depolarizing mixture v. 



Bayes' theorem. 



P 



n+l 



Pr[Dn\p+,^]Pn 

Pr[D,,\P^,(f>] 



(8) 



Beginning with n = N and progressing backwards to- 
wards n = 0, we can thus calculate the expected proba- 
bility of error for any n by evaluating Eqs. ([t]) and ^ at 
each step using 

Pr[A.|p±, = ^ + (1 - ^) cos^ (0 - ^(A. - 1) T f ) . 

(9) 

In doing so we traverse a binary tree in which all possible 
permutations of measurement results are visited. The 
probability of error for TV copies of the state is therefore 
given by 



CjY — Ro{Pi), 



(10) 



where Pi = q = 1/2. 

Armed with an expression for the probability of error 
for all N copies, we can determine the globally optimal 
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FIG. 7: Probability of error of locally optimal adaptive local 
measurements of up to 10 copies with a = 7r/6 for various 
amounts of depolarizing mixture i/. 



fixed measurement angle by simply minimizing over (j). 
In this case, the optimal measurement angle, 0^'-'^, will 
depend on N and as well as a. Figure [5] shows exam- 
ples of this globally optimal fixed measurement angle for 
various amounts of mixture. We see that for A'^ > 1 in the 
pure state case, a pair of measurement angles are equally 
optimal, becoming increasingly biased towards one or the 
other states as N increases. 

For states with appreciable mixture, the measurements 
oscillate between being unbiased for odd N, and some- 
what biased for even N. In doing so, the scheme avoids 
the possible condition of obtaining equal credulities for 
the states following analysis of all measurement results. 
A mixture of = 0.02 appears to be an intermediate 
regime (but see Sec. Ill I. Figure |6] shows the error prob- 
abilities for these measurements. 



D. Locally Optimal Adaptive Local Measurements 

For the pure state case, v = 0, it has been shown the- 
oretically and demonstrated experimentally [S] that, 
with measurements allowed to vary adaptively, one can 
achieve an error probability exactly equal to that of the 
collective measurements for a simple, locally optimal, 
adaptive scheme. That is. 
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r,2N 



(11) 



Here each copy is measured in sequence using the locally 
optimal single-copy measurement given the Bayesian 
credulity at each stage, Pn p.^- This measurement basis 
is defined by the angle 4>]f^^ = (/)°^^(P„). The measure- 
ment result is used to determine an updated credulity 
P„+i, following Eq. ([s]). This is subsequently used to 
find the (locally optimal) angle for the next measure- 
ment, and so on in turn for each of the N copies. 

For the mixed state case, > 0, one could consider a 
direct approach, in a similar fashion as may be considered 
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FIG. 8: Ratio of the error probability of locally optimal adap- 
tive local measurements and the error probability of globally 
optimal fixed local measurements, jC'^'^^ , for various 

mixtures v. Values above 1 indicate the LOA scheme ex- 
hibiting error probabilities greater (i.e. worse) than can be 
obtained using the GOF scheme. 



for the GOF scheme, by evaluating the error probability 
over the entire set of measurement outcome strings. Do- 
ing so might be feasible for the moderate N considered 
here, but the complexity of the computation increases 
exponentially, making such an approach intractable for 
the large iV considered in Sec. |III[ 

The dynamic programming approach described in the 
previous section can also be used to calculate the error 
probability of this scheme. Here, however, an expression 
for the measurement angle is already well-defined, given 
the credulity P„ at the nth copy (which can be calcu- 
lated). The overall probability of error for this scheme 
can therefore be calculated without requiring a numerical 
optimization. Examples are illustrated in Fig. [7j 

The optimality of the LOA scheme (in terms of low- 
est overall error probability) does not hold for > 0. 
For certain v and N the LOA scheme results in a higher 
probability of error than a fixed scheme [9] . Figure [s] il- 
lustrates this by comparing the LOA scheme to the GOF 
scheme. The error probability for each scheme decreases 
for each newly measured copy (as each measurement pro- 
vides information) however there appears a complicated 
relationship in the ratio of error probabilities of each 
scheme. The ratio is not monotonic in v for 'N fixed, 
nor is it monotonic in N for v fixed. 



E. Globally Optimal Adaptive Local Measurements 

To determine the measurements necessary to achieve 
the lowest error probability possible using local measure- 
ments in the general case, we must perform global opti- 
mization in a manner similar to that done for the GOF 
scheme. Relaxing the condition that </> is fixed, the mea- 
surement angle for the nth measurement becomes a func- 
tion of n and the credulity P„, i.e. <^n{Pn)- To consider 
the potential values of P„ between and 1, we construct 
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a table of measurement angles (f>^'~'^{Pn) with a large 
number s of linearly spaced samples of < P„ < 1. For 
intermediate values of P„, the angle 'p^'^^ is estimated 
by cubic interpolation of nearest-neighboring samples. 

Bellman's principle of optimality [101 HI] requires that 
the final stages of an optimal scheme, whatever the condi- 
tions are when those stages commence, must necessarily 
themselves be optimal under those conditions. Adhering 
to this principle, we construct the table of measurement 
angles by first recognizing that the final measurement 
angle (I}%°^{Pn) = ^°^^(PAr), the optimal single-copy 
angle, as there is only one copy available (the final copy) 
to measure at this stage. Thus, entries in the final (iVth) 
column of the measurements table represent the optimal 
single-copy measurement angles (j)'^^^{PN) for all s sam- 
ples of P/v. 

From this we determine the preceding measurement 
angle, (j)%°i{PN-i), by minimizing i?jv-i(PAr-i, 0^°^), 
given Pn-1- We do this for each Pn-i sample, defining 
the (A^ — l)th column of the measurements table repre- 
senting This then defines the optimal penultimate 
measurement for whatever credulity Pn-i may be found 
by that stage. The {N — 2)th measurement can be de- 
termined similarly by minimizing Rn~2{Pn~2, ^^r 
all samples of Pn-2: and so on for the remaining copies. 
Satisfying the optimality principle at every stage of this 
reverse construction guarantees that the entire measure- 
ment sequence is, of all possible local measurement se- 
quences, the globally optimal one. After constructing 
the final column of the measurements table, we find the 
probability of error C^'^^ = Ro^^iq)- 

It is important to note that the backwards optimizing 
construction of the measurements table ensures that, for 
any n, the final n measurements are globally optimal. It 
follows that the values of R^'^^ obtained by minimiza- 
tion during this backwards process equal the forwards 
probabilities of error C^°^ for each N — n. Making use 
of this fact simplifies the calculation of the measurement 
angles and error probabilities for multiple (consecutive) 
values of A^. 

Figure |9] illustrates the globally optimal adaptive mea- 
surement angles in a mixture regime for up to ten copies, 
calculated using s = 2501 samples in the measurement 
and error probability tables. We found this to be suf- 
ficiently accurate as increasing the number of samples 
made no discernible difference to the outcome for all 
cases considered here. For large A^, the initial measure- 
ments are approximately unbiased, that is, (f) ~ 7r/4, in- 
dependent of the credulity P„. A complex pattern of 
measurement angles becomes apparent as more copies 
are measured, with the final measurement being the op- 
timal single-copy measurement by definition. That is, 
<^(P„) = '/'°™(^n) of Eq.|4l 

Also illustrated in Fig. |9] are six example trajectories, 
found by simulating the GOA scheme with measurement 
results generated randomly according to the outcome 
probabilities as a function of the state and measurement 
angles. The actual state is p+ in each case plotted. As 
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FIG. 9; Measurement angles (f} for globally optimal adaptive 
local measurements of up to 10 copies with a = n/6 and i/ = 
0.1. As the scheme acquires measurement results it updates 
the credulity Pn, and chooses the subsequent measurement 
setting corresponding to the shade at the intersection of Pn 
and n+ 1. The lines illustrate examples of this measurement 
sequence. 
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FIG. 10: Probability of error of globally optimal adaptive lo- 
cal measurements of up to 10 copies with a = tt/6 for various 
amounts of depolarizing mixture i/. 



each copy is measured, the credulity moves in a direction 
corresponding to the measurement result. In some cases 
the measurement results are such that the direction is 
away from the true state. In the average case (and in 
the asymptotic limit) the credulity P„ increasingly cor- 
responds to the true state. 

The probability of error for the GOA scheme is plotted, 
for various levels of mixture, in Fig. |10[ For pure states 
this scheme reduces to the LOA scheme, achieving the 
same error probability as the optimal collective measure- 
ment. For mixed states the GOA scheme does not achieve 
the error probability of the optimal collective measure- 
ment, but no other local measurement scheme exhibits 
lower probability of error. 
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III. IN THE LIMIT OF LARGE N 

As the number of copies grows large, i.e. — >■ oo, the 
asymptotic behaviour of the error probabihty is described 
by a function decreasing exponentially in N |15) . 



C 



N 



The ChernofF bound, 



^ = — lim 



-CAT 



logC 



N 



N 



(13) 



corresponding to a given measurement scheme, is a con- 
stant quantifying the asymptotic scaling of the error 
probability, with higher Chernoff bound equating to 
faster scaling. 

Some notable theoretical results already exist in this 
asymptotic regime. It is known that fixed local measure- 
ments are capable of achieving asymptotic scaling equal 
to that of adaptive local measurements [TT]. This result 
was recently corroborated by an approach in which the 
error probability of state discrimination is reformed as a 
semidefinite programming problem, and for which strict 
bounds on the error probability were calculated [H] using 
efficient numerical algorithms fTE|. Reference [H] used 
the same method to investigate other schemes similar to 
those presented here. 

In the following sections we directly calculate the large 
N behaviour for each scheme considered in the previous 
sections. We consider optimal schemes under the con- 
dition of local and collective measurements given states 
under depolarizing mixture, and thereby provide a di- 
rect verification of the asymptotic results of Refs. [TT] 
and \m. 



A. Optimal Collective Measurement 

For equally likely states p+ and p- , the Chernoff bound 
for optimal collective measurements (i.e. the quantum 
Chernoff bound) is P 



OCM 



a -1 — ai 



log min Tr[/5^/5J 

0<a<l 



(14) 



We can write the states in a diagonal representation, each 
as a mixture of a pair of orthogonal pure states a± and 
a^, with p± = (1 — iy/2)a± + {v/2)a^. Substituting, we 
find 



^OCM ^ _log^minJr{[(l - i./2)"<T+ + (j./2)"a:^] 

X [(1 - i./2)i-"a_ + (i./2)i-"^^] } (15) 
= - log min (c^ + (1 - c^) \{l - i//2)°(iy/2)i-" 

0<a<l L L 

+ {l-u/2f-^{v/2r]]. (16) 



The minimum, which is unique due to convexity [B], is 
satisfied for a = 1/2. The ChernofF bound for the optimal 
collective measurement of mixed states p± is therefore 



OCM 



= - log [l - (1 - c2) (l - 



(17) 

(12) For pure states {v ~ 0) this reduces to —2 log c, consistent 
with the error probability scaling C'^'^^ ~ c^^ found by 
taking Eq. (IS]) in the limit of large N (with equal prior 
probabilities)? 



B. Locally Optimal Fixed Local Measurements 

We can rewrite the LOF error probability Eq. ([6]) as 



C 



LOF 

N 



OSM 



n=0 ^ 



(JO'S.M 

For large N this has the same scaling as the integral 



(18) 



,-tLOF 



(c 



osM^ 



1 _ (-fOSM \ " 

1 dn. (19) 



(JOSM 



When n is near A^/2, the binomial coefficient function 
varies slowly (approximately as a Gaussian of variance 
7V/4), but as n approaches N/2, [(1 - c"='S'^)/C°SM]n 
grows exponentially. The scaling is therefore dominated 
by this term, and the error probability can be approxi- 
mated as 



^LOF 



OSM\^ ( N 

' \N/2 



N/2 



1 - c 



OSM 



dn 



(20) 



.OSM^N N\ [(1 _ COSM)/cOSM]JV/2 



(C°™)^ 



[(Ar/2)!]2 log[(l - COSM)/COSM] 



(21) 

To determine the LOF Chernoff bound ^'"OF take 



the natural logarithm of Eq. (21 1, 

(iV/2) l0g[(l - COSM)/cOSM] ^ ^ ^OSM 

+ log(A^!) - 2 log[(iV/2)!] + const., (22) 



logC^OF 



where terms constant in N have been omitted. Using 
Stirling's approximation, this becomes 



logC 



LOF 

N 



- COSM)C^OSM 



(1/2) log + const. 



(23) 



Substituting into Eq. ([13| and taking the limit — > oo 
we obtain 



il/2)\og[l-{l-vf{l-^)\ 



(24) 
(25) 



For pure states this reduces to — logc for a probability 
of error scaling as C^*-*^ ^ c^, quadratically worse than 
C'^^'^ ~ c^^, consistent with the known scaling [TlH]. 
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FIG. 11: Angles of globally optimal fixed local measurements 
0GOF £qj. unlimited number of copies with a = 7r/6 as the 
amount of depolarizing mixture v varies. For pure states there 
are two equally optimal measurements, fully biased such that 
in each case one projection is in the direction of one of the 
states p±. As mixture increases, the bias of the measurement 
decreases. At the critical point v'^''^^ the globally optimal mea- 
surement angles equal. For any amount of mixture beyond 
this point the globally optimal fixed measurement angle is 
7r/4. 



C. Globally Optimal Fixed Local Measurements 

It has previously been shown |17j that the scaling of 
the GOF scheme can be quantified by considering the 
classical ChernofF bound [15' applied to fixed local mea- 
surements of each independent copy. Doing so for states 
P± gives 



GOF 



log min min Mia. 6), 

0<a<l O<0<Tr/2 



(26) 



where 

M{a,c^) = (Tr[|^)(<^|p+]r {TmmAf-" 

+ (Tr[|0^)(</-^|p+])"(Tr[|0^)(0^|p_]) 



l-a 



(27) 



We numerically minimize the function M{a,(l)), thus 
determining given the parameters v and a that 

define the states p±. An example is shown in Fig.[TT]for a 
fixed a. For pure states we find that the globally optimal 
fixed measurement is fully biased — that is, = a/2 or 
(f)-^ — — 7r/2 = —a/2, such that one of the measurement 
vectors lies in the direction of one of the two states. This 
is consistent with what we might expect to find given the 
results plotted in Fig. [5] for finite N. It is also consistent 
with the proof given in Ref. [7 that, for pure states, fully 
biased measurements exhibit error probability with the 
same scaling ^ as optimal collective measurements. 

For states with sufficient mixture v, we find the optimal 
fixed measurement is unbiased, equivalent to the LOF 
scheme. This confirms the results presented in Ref. 
that showed unbiased measurements performing better 



0.02 



0.015 



0.01 



0.005 




a (radians) 

FIG. 12: The minimum amount of depolarizing mixture for 
which the globally optimal fixed local measurement in the 
asymptotic limit is (j)^'-'^ = 7r/4 (critical mixture, v'^'^^^) as 
the separation between the states a varies. 



than other schemes, given enough copies of sufficiently 
mixed states. 

As the purity of the states increases, the optimal 
fixed measurement angle bifurcates at a critical mixture 
I/"" For mixture > z^"'*, ^^O'" = n/4. For mix- 
ture !/ < J^'^'''' there exists two unique and equally optimal 



fixed measurement angles related by (f)f-'^ = tt/2. 



In Fig. 



11 



the bifurcation is found at ly"^^ « 0.012. As 
the mixture decreases, the bias of the optimal measure- 
ment increases monotonically. 

We find that different separations of the states a ex- 



hibit different critical mixtures. Figure 12 shows the de- 
pendence of v"^^ on a, obtained by numeric minimiza- 
tion. The first notable fact is that there is an absolute 
upper bound on j/'^"*, of about 0.0188. That is, there is an 
error threshold of about 2% depolarizing noise, beyond 
which there is no advantage asymptotically to using any 
local strategy beyond the simplest (locally optimal fixed) 
one, in which the final guess is determined by whichever 
result occurred in the majority of measurements. 

The second notable fact is that i/"^^ attains this up- 
per bound at an intermediate degree of nonorthogonality 
of the pure states (prior to depolarization). As the two 
pure states approach orthogonality (a = 7r/2) or iden- 
tity (a = 0), the value of i/'^"^ is zero. That is, any 
degree of mixture added to such states reduces the op- 
timal solution to a repetition of the optimal single-qubit 
measurement. In the limit a — > 7r/2, this result is not 
surprising, as in this limit for pure states the fully bi- 
ased measurement (which is the globally optimal fixed 
measurement) becomes the same as the unbiased mea- 
surement, because the states become orthogonal. But 
in the other limit, a — >■ 0, these two measurements are 
as different as they can be, so the discontinuous change 
in the optimal measurement as mixture is added is sur- 
prising, and may even be thought paradoxical. What 
must be borne in mind is that the asymptotic calcula- 
tion makes no reference to how many copies are required 
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FIG. 13: Difference of nearby log error probabilities as a func- 
tion of A^, approaching ^ as grows large. Values are calcu- 
lated for a = 7r/6 and v = 0.02 using the table sampling and 
interpolating approximation method with s — 2501 samples. 
AA'^ = 2 is used to avoid large variations due to the stepwise 
nature of the LOF error probabilities. 



to approach the asymptotic regime. For almost identical 
states, ^'"OP is very small, reflecting the fact that it is 
very difficult to distinguish the states. From Eq. (23) we 



can see that the asymptotic regime will be reached only 
when iV/log7V > 1/^^°^. That is, as a ^ 0, the N re- 
quired to reach the asymptotic regime diverges. Thus for 
any fixed N, no matter how large, as a — > the problem 
is necessarily non-asymptotic, and the globally optimal 
fixed measurement for pure states (the fully biased one) 
will remain close to the globally optimal fixed measure- 
ment for states with a small amount of depolarizing noise. 



D. Locally Optimal Adaptive Local Measurements 

We may find the Chernoff bound f for the adap- 
tive schemes by extending the dynamic programming 
approach (introduced in Sec. II C I into the asymptotic 
regime. We take the difference of the logarithm of error 
probability between two nearby points in the regime of 
large N, thereby obtaining an estimate of 



dN 



(28) 



For the conditions we consider, we find that this gradi- 
ent reaches a constant (indicating asymptotic behaviour) 
before TV = 400 copies — see, for example. Fig. [13] 

For such large N, calculation using the recursive ap- 



proach as detailed in Sec. II C becomes computationally 
infeasible. To avoid this, we instead employ the table 
sampling and interpolating approximation approach used 
to determine the error probability of the GOA scheme in 
Sec. |II E| For the LOA scheme, the measurement angles 
(j)^'~'^{Pn) are already well-defined, equal to the optimal 
single-copy measurement angle (/)°^^(P„) where, as be- 
fore, Pn is the Bayesian credulity of having the p+ state 



given the results of the previous n — 1 measurements. We 
may then use this definition to calculate the error prob- 
abilities, making use of the fact that R^f^ = C'w=t- We 
do so for up to 400 copies. 

We have found that, due to the limitations of ma- 
chine precision, the interpolating approximation becomes 
problematic when attempting to correctly determine the 
error probabilities for such large N. The problem is 
best understood by considering the results of this ap- 
proach as applied to the LOF scheme (i.e. with predefined 
4'n — (j)^'^^) as compared to the results we expect due to 
the exact expression for the error probability. The sharp 
stepwise pattern of error probabilities, evident in the ex- 
act result for this scheme and which can be seen in Fig. |4j 
becomes less distinct and "washed out" for increasing N 
when calculating using the interpolating approximation. 
The outcome is a value for the error probability that is 
significantly lower than the correct value, and a value 
for the LOF Chernoff bound, significantly greater 

than the value as determined by our analytic derivation 
given in Sec. |IIIB| 

We believe that a similar effect also results in the un- 
dervaluing of LOA error probabilities and overestimation 
of the LOA Chernoff bound. This belief is justified as 
we note that decreasing the number of samples s results 
in an exaggeration of this effect. Conversely, increasing 
s reduces the effect, however the reduction appears to 
be exponentially decreasing in s. Indeed, for the LOF 
scheme, the overestimation is still quite apparent even 
for s = 10001 samples. Increasing s quickly becomes 
computationally infeasible. 

To nevertheless extract an accurate estimate of ^^'^^ 
from these results, we calculate representative samples, 
for various s S {501, 1001, 1501, 2001, 2501, 10001} 
and fit them to the function 



LOA 



y 

(log s) 



(29) 



using a least-squares method. Here, x and y are allowed 
to vary. To maintain stability of the solution, z is fixed 
to the value of 1.22, which is found to work best. 

We finally evaluate the fit function in the limit s — > 
oo; that is, we use S,^^ = x. The result of this is a 
best-guess approximation of the Chernoff bound for the 
scheme. We have found that this approach works well for 
all but the highest-purity states, where the projection 
becomes unstable and the results clearly incorrect (we 
thus omit those results). The results are shown in Fig. 14 
as a function of v. As this shows, applying the approach 
described here to the LOF scheme produces ^^of -j-j^g^-j- 
only a tiny amount R:!0.0024 below the exact analytical 
results, which validates the approach. 



E. Globally Optimal Adaptive Local Measurements 

We find the Chernoff bound for the GOA in the same 
manner as we do for the LOA scheme, by extrapolating 
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schemes. For states of high purity, the LOA scheme has 
better scahng than the LOF scheme, but the converse 
becomes true as the mixture of the states is increased. 
For any significant amount of mixture (here greater than 
approx. V = 0.008), the LOF, GOF, and GOA schemes 
aU agree. While the numericaUy calculated ^'^OA ^ 
tiny bit lower than the analytically calculated ^^^^f 
^GOF^ it lies on top of the numerically calculated 5^^°^, 
as expected. Thus we have confirmed that, for all but 
very pure states, the LOF, GOF, and GOA are all opti- 
mal local schemes in the asymptotic limit, outperforming 
the LOA scheme. 

IV. CONCLUSION 



FIG. 14: Chernofl bounds for various state discrimination 
schemes applied to states with a = 7r/6 and various levels of 
mixture. Results for OCM and LOF schemes are calculated 
analytically, indicated by solid lines (upper and lower, respec- 
tively). Results for the LOA and GOA schemes, as well as 
the LOF scheme for comparison, are calculated using the ex- 
trapolated sampling approximation method. Results for the 
GOF scheme are found by direct optimizing search. The re- 
sults confirm the GOF and GOA schemes possessing the same 
scaling in the asymptotic limit. 



fits of the log error probability gradient in the high-iV" 
regime. Unlike for the LOA scheme, in this case the 
measurements are not well-defined in advance, and must 
be determined by minimizing the error probability. The 
GOA Chernoff bound is calculated for several numbers 
of samples s, and an extrapolation for s — )■ oo gives us a 
final estimate of the GOA Chernoff bound, 

The Chernoff bounds for each of the four local schemes 

7r/6. 



are shown in Fig. 14 for states with a = 7r/6. The 
quantum Chernoff bound of optimal collective measure- 
ments is also shown. Optimal collective measurements 
scale better than any other scheme in all cases except 
for pure states. For pure states, all but the LOF scheme 
have a Chernoff bound of ^ = — 21ogc (this is — logc 
for the LOF scheme). The Chernoff bound of all local 
measurement schemes rapidly degrades as mixture is in- 
troduced. For mixed states, the GOA and GOF schemes 
have equivalent scaling, optimal for local measurement 



We have investigated the probability of error exhib- 
ited by a representative set of local and collective mea- 
surement schemes for multiple-copy state discrimination 
acting on depolarized qubit states. We find that for any 
nonzero amount of mixture, local schemes fail to exhibit 
error probabilities approaching that of optimal collec- 
tive measurements, while the globally optimal adaptive 
scheme introduced in Ref. [9^ outperforms all other local 
schemes. 

We have presented various ways in which we may cal- 
culate the error probabilities and measurement settings 
of the four local measurement schemes for finite iV. In 
addition, we have presented analytical expressions and 
numerical calculations of the asymptotic behaviour (the 
Chernoff bound) for each scheme. These results provide 
a direct verification of the asymptotic scaling behaviour 
predicted of local measurement schemes in works such 
as Refs. [TT] and |12]- An important conclusion of our 
work is that there exists a critical degree of mixture, 
'^max ~ 0.0188, such that, in the asymptotic regime, 
for any pair of states with at least this much depolar- 
ization, the optimal local measurement scheme to dis- 
tinguish them is the simplest: the locally optimal fixed 
measurement scheme. 

We thank Robin Blume-Kohout and Sarah Croke for 
helpful discussions. This work was supported by the Aus- 
tralian Research Council, project DP0986503. 
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